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METHOD, SYSTEM, AND PROGRAM FOR FILTERING 
CONTENT USING NEURAL NETWORKS 

Related Applications 

5 This application is a continuation of U.S. Patent Application No. 09/478,925, 

filed on January 6, 2000, which patent application is incorporated herein by reference in 
its entirety. 

BACKGROUND OF THE INVENTION 
10 1. Field of the Invention 

The present invention relates to a system, method, and program for filtering 
content from an incoming data stream being accessed by a viewer program using a neural 
network to inhibit access to material deemed unacceptable. 

15 2. Description of the Related Art 

Although the Internet is heralded as a revolutionary technology capable of 
empowering people by providing relatively easy access to a tremendous store house of 
information, it is also derided for allowing unrestricted access to material considered 
highly inappropriate for children, such as pornography, hate literature, and violence. 

20 Many lawmakers and various interest groups have called for government censorship of 
the Internet because of the availability of such controversial material. One other concern 
with the Internet has been the ability of children to engage in inappropriate 
communications with adults on IRC chat lines that not only exposes children to 
inappropriate material, but also facilitate inappropriate encounters between children and 

25 adults. Businesses that network their employee computers to the Internet are also 
concerned that employees spend time on non-business related Internet activity. Thus, 
businesses are also interested in controlling which Internet sites their employees may 
visit to prevent unproductive employee Internet access. 
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The response of the computer industry to these concerns has been the 
development of filtering software, such as Net Nanny by Net Nanny LTD., which is 
described in the "Net Nanny User Guide" (Copyright Net Nanny LTD., 1997). Such 
prior art filtering software is capable of monitoring Internet software, such as browsers, 
5 e-mail and IRC chat room applications and also other application programs, such as word 
processing and image viewing applications, e.g., Adobe Photoshop, Corel Photo-Paint, 
and Microsoft Paint. Current filtering software operates by providing a list of Internet 
sites and words that could be deemed inappropriate to children. To monitor Internet 
sites, the filtering software developer provides user lists of inappropriate web sites and 

10 other Internet sites, such as newsgroups and IRC chat lines, that its in-house researchers 
have located and deemed inappropriate for a particular age group of children. The 
purchaser of the filter program, which may be a parent, library, school, etc, may then set 
the filter software to deny access to all the Internet sites on the provided list. 

The current techniques for identifying undesirable web sites rely on the filter 

1 5 developer to provide thorough research on web sites and timely updates. However, there 
are an immeasurable number of Internet web sites, many of which cannot be located 
through traditional search engines. Thus, researchers may miss numerous Web sites 
containing inappropriate material that are not accessible through traditional search 
techniques. Further, web sites are being added and removed all the time. Thus, a child 

20 would have access to inappropriate sites that are added to the Internet between updates of 
the list. 

The prior art method for filtering documents is to scan the document for words on 
a "hit list" of unacceptable words, and deny access to those documents containing a word 
on the hit list. This prior art technique for identifying inappropriate documents may 
25 screen or deny access to numerous acceptable documents. For instance, if the word 
"breast" is on the list, then the filtering software would deny access to documents 
describing cooking recipes for chicken breast or for medical documents on such 
important issues as breast cancer. In fact, many have criticized the use of word based 
lists as a filtering tool because it often screens out much useful educational information. 
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For instance, if a list included common hate terms or a discussion of hate groups by a 
civil rights or anti-hate group, then such filtering software could conceivably deny access 
to deemed appropriate anti-hate literature. 

Businesses are often interested in limiting not only what employees cannot 
5 access, like the child filtering product, but also limiting what they can access. For this 
reason, businesses may use firewall software that would prevent their employees from 
accessing any Internet sites except those specifically sanctioned or on an approved list. 
However, such techniques may be problematic because the employee could be barred 
from readily accessing a work-related site that is not on the approved list, such as a site 
10 linked to or mentioned from a site on the approved list. 

Thus, there is a need in the art for an improved technique for filtering Internet and 
computer readable material for inappropriate content, while at the same time providing 
flexibility to allow access to educational material that may otherwise include words or 
phrases that are often associated with inappropriate material. 

15 

SUMMARY OF THE PREFERRED EMBODIMENTS 
To overcome the limitations in the prior art described above, preferred 
embodiments disclose a method, system, and program for filtering a data object for 
content deemed unacceptable by a user. A data object requested by a viewer program is 

20 received. The data object is processed to determine predefined language statements. 
Information on the determined language statements is inputted into a neural network to 
produce an output value. A determination is then made as to whether the output value 
indicates that the data object is unacceptable. Viewer program access to the data object is 
inhibited upon determining that the data object is unacceptable. 

25 In further embodiments, a determination is made as to whether the output value 

indicates that the data object requires further consideration. If so, information on the data 
object is logged. User input on a rating indicating the acceptability of the data object is 
received and the neural network is trained to process the logged data object as input to 
produce the received rating as output. 



4 IBM Docket No. TU999060US2 

Firm N0.OOI8.OO66C 

The data object may comprise any incoming data stream, such as data from a 
document or one of multiple packets that form a document requested by the viewer. 

In yet further embodiments, inhibiting the viewer program access to the data 
object upon determining that the data object is unacceptable comprises blocking access to 
5 the entire data object. Alternatively, a determination may be made of language 

statements in the data object that are unacceptable. In such case, inhibiting the viewer 
program access to the data object upon determining that the data object is unacceptable 
comprises blocking access to the unacceptable language statements and allowing access 
to language statements not determined to be unacceptable. 

10 The preferred embodiments provide a filter program that is capable of considering 

more factors of a document than current filter software that would block access based on 
the presence of a single unacceptable word. The preferred filter program is capable of 
considering all unacceptable words as well as acceptable content words. The preferred 
embodiment filter program utilizes neural networks to train the filter to recognize when a 

1 5 document includes a level of inappropriate language that exceeds an acceptability 
threshold. Using neural networks allows the filter to consider documents that are not 
identical to the ones used to train the network; yet what was learned during training is 
applied to determine whether to inhibit access to a different document. The preferred 
filter program is an improvement over current art because it is less likely to deny access 

20 to acceptable documents that may otherwise include unacceptable words, whereas current 
filtering programs would reject a document upon the occurrence of a single specified 
unacceptable word. 

Preferred embodiments that filter packets of data, tvnicallv transmitted over the 
Internet, are an improvement over current filter art utilizing lists of unacceptable Internet 
25 web sites because the preferred filter program will inhibit access to unacceptable Internet 
web sites that were not located by searchers creating the list of prohibited web sites or 
added to the Internet between updates of the list. The preferred filter program also 
reduces the cost of producing the filter software because the developer does not have to 
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maintain a full-time staff of employees searching for inappropriate web sites to add to the 
lists of prohibited sites. 

Further, in certain embodiments, only the inappropriate statements may be 
blocked without affecting other statements in the document or incoming data streams if 
5 inappropriate material is posted. For instance, if a child is at an acceptable children 
oriented IRC chat room and someone starts posting obscene or other inappropriate 
material, only those incoming data streams that contain offensive material would be 
blocked, without affecting the ability to receive appropriate incoming data stream content 
from the site. With such embodiments, only access to the unacceptable content is 

10 blocked and incoming data streams with acceptable content are allowed through. 

Moreover, preferred embodiments allow for continual consideration of Internet 
sites and documents that are not clearly acceptable or unacceptable by allowing the 
administrator, such as the parent or employer, to retrain the neural network filter program 
to provide a specific accept or deny outcome for such documents that do not fall within 

15 the unacceptable and acceptable ranges. This allows the administrator to tailor the neural 
network and provides even greater likelihood that access will not be inhibited to 
acceptable material including some inappropriate language. 

BRIEF DESCRIPTION OF THE DRAWINGS 
20 Referring now to the drawings in which like reference numbers represent 

corresponding parts throughout: 

FIG. 1 is a block diagram illustrating a computing environment in which preferred 

viiiuvuuiivmo uiv iiiij/i vii ivn ivvj, 

FIG. 2 illustrates a neural network used to implement the filter program in 
25 accordance with preferred embodiments of the present invention; 

FIG. 3 illustrates logic to filter packets of a document in accordance with 
preferred embodiments of the present invention; 

FIG. 4 illustrates logic to retrain the filter program in accordance with preferred 
embodiments of the present invention; and 
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FIG. 5 illustrates logic to filter a document in accordance with preferred 
embodiments of the present invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
5 In the following description, reference is made to the accompanying drawings 

which form a part hereof and which illustrate several embodiments of the present 
invention. It is understood that other embodiments may be utilized and structural and 
operational changes may be made without departing from the scope of the present 
invention. 

10 FIG. 1 illustrates a computing environment in which preferred embodiments are 

implemented. A computer system 2 includes a plurality of viewer programs 4a, b, c and 
a filter program 6. The computer system 2 may comprise any computing device known 
in the art, such as a personal computer, mainframe, workstation, server, hand-held 
computing device, etc. The viewer programs 4a, b, c may comprise application programs 

15 that enable users to view and/or transmit content, such as an HTML Web browser, 
newsgroup readers, word processing programs, image viewers (e.g., Adobe Acrobat), 
etc., and communication software that allows person-to-person communication over the 
Internet (e.g., chat room software, AOL Messenger, ICQ, etc.), The filter program 6 is 
capable of filtering content requested by the viewers 4a, b, c in order to inhibit access to 

20 material deemed undesirable. 

The viewers 4a, b, c are capable of accessing data from the Internet and/or a 
storage system 8, which may comprise one or more hard disk drives, tape storage, CD- 
ROM, diskette, electronic memory, or any other storage or memory medium known in 
the art. The term Internet 10 as used herein refers to the World Wide Web as well as any 

25 other network or Intranet to which the computer 2 has access. In preferred embodiments, 
the Internet 10 uses a communication protocol, such as TCP/IP, that transmits data in 
packets. 

In alternative network embodiments, the filter program 6 may be included within 
a gateway computer or proxy server through which multiple computers in the same Local 



7 IBM Docket No. TU999060US2 

Firm N0.OOI8.OO66C 

Area Network (LAN) or wide area network (WAN) access the Internet. In such case, the 
filter program 6 would filter content for all networked computers accessing the Internet 
through the gateway computer on which the filter program is installed. In this way, the 
users at the client computers cannot disable or affect the operations of the filter program 
5 6 on the proxy server or gateway computer. 

FIG. 2 illustrates a feed forward, back propagation neural network 20 in which 
the preferred embodiment filter program 6 is implemented. The feed forward 
back-propagation network 20 has an input layer, an output layer, and at least one hidden 
layer. Each layer is fully connected to the succeeding layer, which means that each node 

10 in one layer fully connects to each node of the following layer, i.e., each input node 

connects to every hidden layer node and each hidden layer node connects to every output 
node. In accordance with feedforward neural networks known in the art, a unique 
weighting value is applied to data flowing from one node into another node. The value 
of a hidden layer node is equal to the sum of data at all the weighted input nodes that 

15 connect to the hidden layer node. Likewise, each output node is equal to the sum of data 
at all the weighted hidden layer nodes that connect to the output layer node. As shown 
in FIG. 2, the sum result of all the operations is a single output value, which is preferably 
between 0 and 1 . In preferred embodiments, the number of hidden nodes is equal to the 
number of input and output nodes divided by two and rounded up to the nearest whole 

20 number. 

In preferred embodiments, the input nodes of the neural network 20 receive 
information on words included in the document or packet being analyzed and from this 
generate the output value from 0 to 1 . The developer would train neural networks 20 for 
different age groups. Before the training, the developer would determine a rating for 
25 training documents between zero and one. The rating would be based on general 

standards of appropriateness for the protected user. The term "protected user" as used 
herein refers to a child, employee or other person whose access to material will be 
inhibited and "administrator" refers to the purchaser of the product who wants to inhibit 
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the protected user's access to content deemed inappropriate, such as the parent, school 
administrator, employer, etc. 

The filter program 6 may specify the output values that would be deemed 
acceptable, e.g., values between 0 and 0.3, values deemed in a grey area requiring further 
5 consideration by the administrator, e.g., values between .3 and .6, and values deemed 
unacceptable for the protected user, e.g., values between .6 and 1.0. The values for each 
of the three range categories, acceptable, grey area requiring further consideration, and 
unacceptable may be adjusted by the administrator. For instance, if the administrator did 
not want to spend time resolving incoming data streams receiving output values in the 
10 grey area, then the administrator would set the grey area requiring further consideration 
to a very small range. On the other hand, if the administrator wants to tailor the network 
and is concerned about screening out too much, then the grey area range would be 
expanded to provide the administrator more control over how the neural network 20 is 
trained. 

1 5 During operation, the filter program 6 would search the training document or 

packet for instances of certain predetermined words, such as unacceptable words and 
acceptable content words. For instance, the filter program 6 could search for sexual 
words and medical or health related words that may indicate the sexual word is part of an 
otherwise acceptable educational document on health or medicine. The filter program 6 

20 would then input information on the instances of the predetermined words at the input 
nodes of the neural network 20 to produce an output value within one of the ranges. 

During training, if the output value from the training document does not have the 
rating assigned by the developer, then the developer will use back propagation neural 
network training techniques known in the art to train the neural network 20 to produce 

25 the desired output value for that document. The back propagation learning process to 
train the network to produce a specific outcome based on a specific input is performed in 
iterative steps: information determined from one of the training documents or packets, 
such as the number of instances of unacceptable words, is applied to the 
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input nodes of the neural network 20, and the network produces some output based on the 
current state of it's synaptic weights (initially, the output will be random). This output is 
compared to the developer selected rating, and a mean-squared error signal is calculated. 
The error value is then propagated backwards through the network, and small changes are 
5 made to the weights in each layer. The weight changes are calculated to reduce the error 
signal for the training document and assigned rating in question. The whole process is 
repeated for each of the training documents, then back to the first case again, and so on. 
The cycle is repeated until the overall error value drops below some pre-determined 
threshold. When the error level has fallen below the threshold, the neural network 20 

10 has learned the problem sufficiently. In alternative embodiments, neural network 
training algorithms known in the art other than back propagation may be utilized. 

The filter program 6 developer would repeat this training process for numerous 
packets or documents having different levels of acceptability to train the network for a 
wide range of different types of packets files, e.g., acceptable health material discussing 

15 sexual material, unacceptable material discussing sex, etc. By training the neural 
network for a wide range of packets or documents, the likelihood is increased that the 
neural network 20 would produce an accurate rating for documents that are not identical 
to the training documents. Thus, with the preferred embodiments two documents from 
different Internet web sites that are similar in content but different in presentation would 

20 likely produce the same acceptability rating. 

To determine the input to the input nodes of the neural network 20, the filter 
program 6 would search for certain inappropriate words or phrases, e.g., sexual 
references, non-work related terms that are usually inappropriately searched by 
employees (e.g., sports scores, stock quotes, news, etc), hate words, etc., and content 

25 words, such as words indicating a medical or health slant to the packet or file, and count 
the number of such inappropriate or content words in the packet or file. Each input node 
of the neural network 20 would correspond to the percentage of instances of one 
inappropriate or content word or phrase in the document. Thus, a weighted presence of 
the word or phrase is used. For instance, the presence of one inappropriate word in a 
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large document would have a relatively low weighting, whereas the present of one or two 
inappropriate words in a one line sentence could have a significantly high weighting. 
The filter program 6 would then input the percentage of instances of a specified word on 
the corresponding node for that word or phrase. In this way, the neural network 20 
5 would base its output value decision on the percentage of instances of a word in the 
document that was on the predetermined list of words to consider. In this embodiment, 
there would have to be a node for each word in the word list. 

In alternative embodiments, the list may group words or phrases according to 
categories or classes of language, e.g., swear words, hate words, sexual references, etc. 

10 There would then be a node for each category of words. In such case, the input to the 
input node layer would be the percentage of instances of a word or phrase of a particular 
category in the packet or document being considered. In further embodiments, 
additional nodes may be provided to indicate word proximity to provide a further level of 
detail in the input information that is used to generate the output rating of the input 

15 packet or file. 

In preferred embodiments, the filter program 6 would have to maintain a list of 
predetermined inappropriate and content words on which to search. However, unlike 
prior art filtering programs, the presence of a particular predetermined inappropriate 
word will not cause the automatic rejection of the file. Instead, all inappropriate words 

20 and content words will be considered together in the neural network 20 to determine the 
acceptability of the document based on how the network 20 was trained to handle such 
documents. Thus, the preferred embodiment neural network 20 would consider the 
context of the words and not just issue a "knee jerk" reaction based on the presence of a 
few inappropriate words. For instance, the presence of one swear word and numerous 

25 references to the First Amendment of the United States Constitution may produce a 
neural network 20 acceptable rating for a filter program 6 trained for older children in 
accordance with the preferred embodiments, whereas such content may produce an 
unacceptable determination in prior art filtering software. 
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In certain embodiments, access may be inhibited by denying access to only those 
portions that include acceptable material, but allowing access to other portions that do 
not include unacceptable material. For instance, if the filter program 6 considers each 
packet transmitted over the Internet, the filter program 6 may allow those yielding 
5 acceptable output values to flow to the viewer 4a, b, c, while blocking access to packets 
producing unacceptable output values. In this way, access to an entire HTML document 
or communication will not be blocked, but only unacceptable parts. Thus, a child logged 
onto an appropriate children chat room will not be screened out from all chat room 
communications due to the presence of some person presenting an inappropriate 

10 message. The preferred embodiments would just screen out the inappropriate portions. 
In cases of filtering an entire document, the unacceptable phrases may be blocked out 
while allowing access to other parts of the document. 

FIG. 3 illustrates logic implemented in the filter program 6 to screen packets 
transmitted over the Internet 10. Control begins at block 100 with the filter program 6 

15 receiving a packet on a communication port connected to the Internet 10 destined for a 
viewer 4a, b, c, such as an HTML browser, chat room program software, etc. The filter 
program 6 would process (at block 102) the packet and a predetermined list of words or 
phrases, i.e., language statements, to locate words in the document that are on the list and 
the number of occurrences of such listed words or phrases. For each located listed word 

20 or phrase in the packet, the filter program 6 would determine the percentage (at block 
104) of the determined number of instances of such word in the document as a whole by 
dividing the number of instances of the listed word or phrase by the total number of 
words in the document. This weighted or percentage number of instances of the listed 
word or phrase (at block 105) would then be placed on the input node of the neural 

25 network 20 corresponding to that word. As discussed, there may be a node for each 
listed word or phrase or for categories of words or phrases. Further, there may be 
additional nodes indicating the proximity of listed words. After inputting the 
information on the listed words or phrases included in the document or proximity 
information to the neural network nodes, the filter program 6 executes (at block 106) the 
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neural network 20 to forward the data at the input nodes to the hidden layer nodes and 
then to the output layer node, applying the weights determined during the learning 
process. The output of this operation is an output rating between 0 and 1. 

The filter program 6 then determines (at block 108) whether this output rating 
5 falls within the acceptable range. If so, the filter program forwards (at block 1 10) the 
packet to the viewer program 4a, b, c unchanged. Otherwise, if the output rating is not 
acceptable, then the filter program 6 determines whether the output rating is within the 
unacceptable range. If so, then the filter program 6 issues (at block 1 14) an error 
message. At this point, the error message could cause the filter program 6 to block the 

10 display of the entire document of which the packet is a part, or the filter program 6 could 
block the packet producing an unacceptable rating and allow other packets to proceed to 
the viewer 4a, b, c. If the output rating is not unacceptable, then it would fall within the 
further consideration category. In such case, the filter program 6 would log (at block 
116) the content of the packet to a journal, and generate an error message to handle the 

15 unacceptable packet in the manner handled at block 114. 

In embodiments where the filter program 6 only inhibits access to packets or 
portions of the document including unacceptable words or phrases, access is maximized 
and protected users are not blocked from otherwise acceptable material that receives an 
unacceptable rating. At the same time, the protected user is shielded from those words 

20 that are deemed offensive. This is an improvement over current Internet filtering 

techniques that bar access to specific sites as any site, including those deemed generally 
acceptable, may be occasionally bombarded with inappropriate content. Thus, unlike the 
prior art filters, the preferred embodiments shield users from inappropriate content that 
appears on an otherwise acceptable site, such as if someone posts inappropriate messages 

25 in a child oriented chat room. 

FIG. 4 illustrates logic implemented in the filter program 6 to allow an 
administrator to set ratings for packets or documents that are rated for further 
consideration, i.e., neither acceptable nor unacceptable. Control begins at block 150 
with the initiation of a retraining routine to process those packets and or documents 
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entered in the log requiring further consideration. This may be initiated by the 
administrator selecting through the filter program 6 graphical user interface (GUI) to 
retrain logged entries that yielded neither acceptable nor unacceptable ratings. The filter 
program 6 then displays (at block 152) the content of each packet logged at block 1 16 in 
5 FIG. 3. A GUI tool (not shown) would be provided to allow the administrator (parent) to 
assign a rating to the packet indicating its acceptability or unacceptability. 

The filter program 6 then receives as input (at block 154) the administrator's 
selection of ratings assigned to logged packets. The filter program 6 then begins a loop 
at block 158 for each logged packet provided a new rating. For a selected logged packet, 

10 the filter program 6 determines (at block 160) the rating the user assigned to such packet. 
The filter program 6 then retrains the neural network 20, preferably using back 
propagation neural network training techniques known in the art, to adjust the weights 
applied between nodes to yield the rating assigned to the selected packet. To accomplish 
this, numerous iterations of the neural network 20 would be performed (at block 162) to 

15 adjust the weights until the output rating converged to the user assigned rating within an 
error threshold. Control then proceeds (at block 164) back to block 158 to consider any 
further selected logged packets. After training, the neural network 20 is configured to 
assign the administrator assigned rating to the packet. In this way, the administrator 
maintains the flexibility to retrain the network and the permissible content for new 

20 documents that are not rated clearly acceptable or unacceptable. This allows the filter 
program 6 to dynamically be adjusted for new documents and files. 

FIG. 5 illustrates logic implemented in the filter program 6 to filter the content of 
a document being retrieved from storage 8 that is not transferred in packets, such as the 
case with the Internet 10 TCP/IP protocol. FIG. 5 is similar to FIG. 3 except that the 

25 entire document is screened and then any located words are passed through the neural 
network 20 to determine whether the document as a whole is acceptable or unacceptable. 
As with packets, when issuing the error message at block 214, upon determining that a 
document is unacceptable or needs further consideration, the filter program 6 could deny 
access to the entire document or filter out unacceptable words or phrases to display those 
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portions of the document not including offensive languages. If a document was deemed 
acceptable, then no words, including those that are unacceptable, would be screened out. 
Further, the administrator may have an opportunity to retrain the neural network 20 to 
assign a specified rating to those documents logged at block 216 in FIG. 5, as with 
5 packets. 

Preferred embodiments provide a more flexible approach to filtering inappropriate 
content than current filtering software techniques which merely screen for words. With 
preferred embodiments words are considered in context along with the number of 
occurrences of such words to assign a rating that is similar to what was assigned to the 

10 closest input document or packet used to train the neural network 20. By training the 
neural network with a wide range of document types, the neural network 20 is more 
likely capable of assigning an accurate ratings to different documents falling within the 
range of training documents or packets. The preferred approach attempts to provide a 
more realistic balance to filtering that takes into account other considerations of a 

15 document beyond the mere presence of an offending term or its point of origin, e.g., 
source Web site of term, to reduce the likelihood that appropriate and educational 
material is blocked. 



Conclusion 

20 The preferred embodiments may be implemented as a method, apparatus or article 

of manufacture using standard programming and/or engineering techniques to produce 
software, firmware, hardware, or any combination thereof. The term "article of 
manufacture" (or alternatively, "computer program product") as used herein is intended 
to encompass one or more computer programs and/or data files accessible from one or 

25 more computer-readable devices, carriers, or media, such as magnetic storage media, 
"floppy disk," CD-ROM, optical disks, holographic units, volatile or non-volatile 
electronic memory, etc. Further, the article of manufacture may comprise the 
implementation of the preferred embodiments in a transmission media, such as a network 
transmission line, wireless transmission media, signals propagating through space, radio 
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waves, infrared signals, etc. Of course, those skilled in the art will recognize many 
modifications may be made to this configuration without departing from the scope of the 
present invention. 

In preferred embodiments, the filter program 6 would search for listed or 
5 predefined words or phrases or word categories. The administrator may add, remove, or 
modify the predefined words/word categories on which searches are performed. If the 
administrator makes such changes, then the neural network 20 must be adjusted to 
remove or add input nodes and hidden layer nodes to accommodate the changes in the 
words/word categories upon which searches are performed. After the neural network 20 

10 is adjusted, then the nueral network would have to be retrained using training documents 
to produce the desired output values. 

Preferred embodiments provided specific architecture and node arrangements for 
the feed forward, back propagating neural network 20. However, those skilled in the art 
will appreciate that alternative node arrangements, including the use of more or fewer 

15 layers of nodes, may be used with the described neural network architecture. 

In summary, preferred embodiments disclose a method, system, and program for 
filtering a data object for content deemed unacceptable by a user. A data object 
requested by a viewer program is received. The data object is processed to determine 
predefined language statements. Information on the determined language statements is 

20 inputted into a neural network to produce an output value. A determination is then made 
as to whether the output value indicates that the data object is unacceptable. Viewer 
program access to the data object is inhibited upon determining that the data object is 
unacceptable. 

The foregoing description of the preferred embodiments of the invention has been 
25 presented for the purposes of illustration and description. It is not intended to be 

exhaustive or to limit the invention to the precise form disclosed. Many modifications 
and variations are possible in light of the above teaching. It is intended that the scope of 
the invention be limited not by this detailed description, but rather by the claims 
appended hereto. The above specification, examples and data provide a complete 
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description of the manufacture and use of the composition of the invention. Since many 
embodiments of the invention can be made without departing from the spirit and scope of 
the invention, the invention resides in the claims hereinafter appended. 



